NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models

Wang, Haoyu; Wang, Peihao; Li, Mufei; Liu, Shikun; Miao, Siqi; Wang, Zhangyang; Li, Pan (December 2025, NeurIPS 2025)

Full Text Available
Meta ControlNet: Enhancing task adaptation via meta learning

Yang, Junjie; Zhao, Jinze; Wang, Peihao; Wang, Zhangyang; Liang, Yingbin (March 2025, Conference on Parsimony and Learning (CPAL))

Full Text Available
Polynomial Width is Sufficient for Set Representation with High-dimensional Features

Wang, Peihao; Yang, Shenghao; Li, Shu; Wang, Zhangyang; Li, Pan (April 2024, Openreview)

Set representation has become ubiquitous in deep learning for modeling the inductive bias of neural networks that are insensitive to the input order. DeepSets is the most widely used neural network architecture for set representation. It involves embedding each set element into a latent space with dimension L, followed by a sum pooling to obtain a whole-set embedding, and finally mapping the whole-set embedding to the output. In this work, we investigate the impact of the dimension L on the expressive power of DeepSets. Previous analyses either oversimplified high-dimensional features to be one-dimensional features or were limited to analytic activations, thereby diverging from practical use or resulting in L that grows exponentially with the set size N and feature dimension D. To investigate the minimal value of L that achieves sufficient expressive power, we present two set-element embedding layers: (a) linear + power activation (LP) and (b) linear + exponential activations (LE). We demonstrate that L being poly(N,D) is sufficient for set representation using both embedding layers. We also provide a lower bound of L for the LP embedding layer. Furthermore, we extend our results to permutation-equivariant set functions and the complex field.
more » « less
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models

Wang, Zhendong; Jiang, Yifan; Zheng, Huangjie; Wang, Peihao; He, Pengcheng; Wang, Zhangyang; Chen, Weizhu; Zhou, Mingyuan (December 2023, Neural Information Processing Systems)

Diffusion models are powerful, but they require a lot of time and data to train. We propose Patch Diffusion, a generic patch-wise training framework, to significantly reduce the training time costs while improving data efficiency, which thus helps democratize diffusion model training to broader users. At the core of our innovations is a new conditional score function at the patch level, where the patch location in the original image is included as additional coordinate channels, while the patch size is randomized and diversified throughout training to encode the cross-region dependency at multiple scales. Sampling with our method is as easy as in the original diffusion model. Through Patch Diffusion, we could achieve ≥2× faster training, while maintaining comparable or better generation quality. Patch Diffusion meanwhile improves the performance of diffusion models trained on relatively small datasets, e.g., as few as 5,000 images to train from scratch. We achieve outstanding FID scores in line with state-of-the-art benchmarks: 1.77 on CelebA-64×64, 1.93 on AFHQv2-Wild-64×64, and 2.72 on ImageNet-256×256. We share our code and pre-trained models in GitHub.
more » « less
Full Text Available
Signal Processing for Implicit Neural Representations

Xu, Dejia; Wang, Peihao; Jiang, Yifan; Fan, Zhiwen; Wang, Zhangyang (November 2022, Advances in neural information processing systems)

Full Text Available

Search for: All records